Evolutionary and Case-Based Approaches to REG: NIL-UCM-EvoTAP, NIL-UCM-ValuesCBR and NIL-UCM-EvoCBR
نویسندگان
چکیده
We propose the use of evolutionary algorithms (EAs) (Holland, 1992) to deal with the attribute selection task of referring expression generation. Evolutionary algorithms operate over a population of individuals (possible solutions for a problem) that evolve according to selection rules and genetic operators. The fitness function is a metric that evaluates each of the possible solutions, ensuring that the average adaptation of the population increases each generation. Repeating this process hundreds or thousands of times leads to very good solutions for the problem. We encode as a fitness function the specific constraints required for the reference to be acceptable. The crossover and mutation genetic operators ensure a reasonable variation between the different options much as a human-generated text would. Each individual is represented by a set of genes that are the list of possible attributes in the reference. Each gene has an associated value of 0 (if the attribute is not included in the reference), or 1 (if the attribute is included in the reference). The initial population should have a low number of genes set to 1, because references tend to be short and the use of all the possible attributes should be avoided. For the crossover operator, two individuals are selected randomly and crossed by a random point of their structure. For the mutation operator, some of the genes are chosen randomly to be mutated from 1 to 0, or vice versa. The fitness function must find a balance between the univocal identification of a referent, and a natural use of attributes. The formula used as fitness function is defined in Equation 1: fitindi = fatti∗weightatt+ident∗weightid (1)
منابع مشابه
NIL-UCM: Most-Frequent-Value-First Attribute Selection and Best-Scoring-Choice Realization
The NIL entry for the challenge has been constructed upon the general architecture for developing Natural Language Generation systems provided by the TAP project (Gervás, 2007). TAP (Text Arranging Pipeline) is a set of interfaces that define generic functionality for a pipeline of tasks oriented toward natural language generation, from an initial conceptual input to surface realization as a st...
متن کاملGeneration of test purposes from Use Case Maps
The Use Case Map (UCM) scenario notation can be used to model service requirements and high-level designs for reactive and distributed systems. It is therefore a natural candidate for use in the process of generating requirementsdirected test suites. We survey several approaches for deriving test purposes from UCM models. We distinguish three main approaches. The first approach is based on test...
متن کاملSpam and Beyond: An Information-Economic Analysis of Unwanted Commercial Messages
The phenomenon of unwanted commercial messages (UCM), including e-mail spam and emerging forms that target other Internet communications facilities, is analyzed from an information-economics perspective. The UCM industry is characterized as a tragedy of the commons. UCM traffic pays off for its senders when it is noticed and consumed by Internet users, and the industry is, therefore, dependent ...
متن کاملUCM-Based Generation of Test Goals
The Use Case Map (UCM) scenario notation can be used to model service requirements and high-level designs for reactive and distributed systems. It is then a natural candidate for use in the process of generating requirement-directed test suites. We survey several approaches for deriving test goals from UCM models. We distinguish three main approaches. The first approach is based on testing patt...
متن کاملMeta-model Based Development of Use Case Model for Business Function
Abstract. The principles and major steps of Enterprise Meta-Model (EMM) based development of Use Case model (UCM) in CASE system environment are presented in this paper. The Enterprise Meta-Model represents the key concepts of domain knowledge. The enterprise processes, management functions, and their interactions are considered as the critical components of the domain knowledge accumulated as ...
متن کامل